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A mathematical review on the multiple-solution problem* 



K. Zhu? X. H. Mo, C. Z. Yuan, P. Wang 
Institute of High Energy Physics, Chinese Academy of Sciences, 
P.O. Box 918(1), Beijing 100049, China 



The recent multiple-solution problem in extracting physics information from a fit to 
the experimental data in high energy physics is reviewed in a mathematical viewpoint. 
All these multiple solutions were found via a fit process previously, while in this letter we 
prove that if the sum of two coherent Breit-Wigner functions is used to fit the measured 
distribution, there should be two and only two non-trivial solutions, and they are related 
to each other by analytical formulae. For real experimental measurements in more com- 
plicated situations, we also provide a numerical method to derive the other solution from 
the already obtained one. The excellent consistency between the exact solution obtained 
this way and the fit process justifies the method. From our results it is clear that the 
physics interpretation should be very different depending on which solution is selected. 
So we suggest that all the experimental measurements with potential multiple solutions 
be re-analyzed to find the other solution because the result is not complete if only one 
solution is reported. 
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1. Introduction 

Interference as a nature phenomenon has been observed for a very long time in situ- 
ations where waves intersect, no matter the mediate material is water, string, sound, 
or light. It has been studied in depth and also widely used in a range of physical 
and engineering measurements. However, classic physics and quantum mechanics 
provide basically different explanations of this phenomenon. In classic physics, if 
two meeting waves are considered contributing to a process, what observed is just 
the sum of amplitudes of two waves, i.e., A(x) = A\{x) + ^(a;), where a; is a gen- 
eralized coordinate which could be position, momentum, time, energy, etc. But in 
quantum mechanics, their wave functions are summed to obtain the total amplitude 
(generally there are relative phases between them), i.e. \ip(x)) = |a(x)) + |6(a;)). And 
the experimentally measured quantities are usually proportional to the modulus of 
the amplitude squared, and thus one generally has contribution from an interference 
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ticle DPI 10.1142/S0217751X11054 589] [copyright World Scientific Publishing Company] 
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33 term (a\b). Compared with the classic ones, many new and fantastic features are 

34 caused by this additional interference term (a\b), and the ambiguity in extracting 

35 information from observation is one of them. 

36 Usually the experimental quantities depending on l^ 2 are measured, and from 

37 which we extract the information of the amplitudes \a) and \b). Unlike in the clas- 

38 sic physics case, as there is a square operation between the observable and the 

39 amplitudes, we would expect other solutions, \a'} and \b'), be found in extract- 

40 ing amplitudes from physics measurements. It is true that the existing freedom on 

41 the global phase is non-relevant to the physics in this extraction procedure. How- 

42 ever, more and more experimental analyses presented recently imply that different 

43 solutions with different relative phases would lead to non-trivial different physics 

44 interpretations. 

45 Some earlier examples reporting multiple solutions arc in the study of th e so- 

46 called Y states via initial state radiation (ISR) by the Belle experimental. The 

47 invariant mass distributions of tt + tt~ J/tp and 7r + 7r _ 0(2S') arc fitted with two co- 

48 herent resonant terms and an incoherent background term. Another example is the 

49 study of the decay dynamics of rf — > 77r + 7r - mode. When the 7r + 7r _ invariant 
so mass distribution is fitted with a coherent sum of the p resonance and a contact 

51 term, two solutions arc found with one solution corresponding to constructive in- 

52 terference between the two amplitudes while the other corresponding to destructive 

53 interference.^ Some recent examples are presented in Refs. |4] and 03 In Ref. Ell two 

54 solutions are found for both the branching fraction measurement of <fi — > ujtt and 

55 the p — lu mixing study. In Rcf. 5, four sets of solutions are found in fitting the 

56 i?-values to extract the resonant parameters of the excited states, namely the 

57 0(4040), 0(4160), and 0(4415). 

5a However, it is notable that in Refs. CQ-[S]all the multiple solutions are found via 

59 fitting process. And we know fit method always suffers from background uncertain- 

60 ties and limited statistics. Then some interesting questions are raised naturally such 

61 as: whether they are exact solutions or only approximate results due to the short- 

62 comings of the fit process or statistical fluctuation; whether these solutions always 

63 exist or just appear in some special cases; how many fold ambiguities there are if 

64 multiple-solution exists; and if one special solution has already been found, whether 

65 the others can be derived from it. Some of these questions are explored from physics 

66 point of view in Refs. [4] and [5j and mathematical attempts are described in Ref. [7] 

67 and El Following the clue of these above mentioned studies we perform the present 

68 investigations in this paper. Comparing with Ref. [7] in which Fourier transforma- 

69 tion is applied, the solution finding process is extremely simplified in this study; 

70 comparing with Ref. El more general conclusions are obtained in this analysis. 

71 In section [21 at the outset, a general and mathematic model for the sum of two 

72 amplitudes is established on the basis of the known facts from physical analyses. 

73 If two amplitudes are both the commonly used Breit-Wigner (BW) functions, the 

74 analytical expression for the two solutions are obtained. Moreover, an effective ap- 
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75 proach is developed for deriving the algebra equations related to the solutions. Then 

76 many double-solutions are deduced for distinctive forms of amplitude functions. Af- 

77 ter that, we put forth a constraint on the ratio of the two amplitude functions, which 
7s ensures that there will be non-trivial double solutions. In section |3l we use a toy 
79 numerical example to check and confirm our results. We also develop a numerical 
so procedure to get the unknown solution from the known one when the form of the 
si amplitude function is extremely complicated. Finally there is a short discussion on 

82 the consequence of having multiple solutions in extracting physics information from 

83 the experimental data, and we also present our suggestion on how to handle this 

84 situation. 

85 2. Mathematical methodology 

86 If scrutinizing the relevant analyses with multiple solutions HHS] we can note two 

87 prominent characteristics: 1. all set of solutions have equal goodness-of-fit; 2. al- 

88 though all parameters including the masses, the total widths, the partial widths 

89 and some other related parameters are allowed to float in the fit, it is observed that 

90 the differences between multiple solutions are only in the partial widths and the rel- 

91 ative phases between the amplitudes. The first point indicates that all solutions are 

92 mathematically equivalent while the second point implies that the main difference 

93 for different solutions is in the normalization factor and the relative phase between 

94 them. In the light of these experimental facts, we abstract a general mathematical 

95 model for multi-solution problem. Without losing generality, the study that follows 

96 focuses on the case of two amplitude functions. 

97 2.1. Solutions for two BW amplitudes 

9a Generally, a sum of two quantum amplitudes can be described by a complex function 

99 e{x, Zi, z%) with form 

100 e(x, z 1 ,z 2 ) = zi g(x) + z 2 f(x) , (1) 



101 

102 
103 



10b 
106 
107 



where g(x) and f(x) are both complex functions, x is a real variable, and z\, z 2 are 
complex numbers. Our goal is to find non-trivial different scries of parameters z[ 
and z 2 that satisfy 

\e(x, zi,z 2 )\ 2 = \e(x,z[,z' 2 )\ . (2) 

Noticed that the global phase plays no role in the amplitude squared, we can reduce 
the dimension of z\ — z 2 parameter space to a z — d space in which d is real number, 
and re-write \e(x, Zi, z 2 )\ 2 to a more convenient form by defining 



\e(x,z u z 2 )\ = - \g{x) +z f(x)\ = — — 



1 + ;/<*> 



2 



^\l + Z F(x)\ 2 J-^E(x,z). (3) 
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Here Fix) = f(x)/g(x) and E(x, z) = \\-\-z F{x)\ 2 . Since |(?(a;)| 2 is only a multiply 
factor and is independent of d and z, it can be dropped in the following discussion. 
Now we only focus on finding different series of d and z that keep E(x,z)/d un- 
changed. Denoting the real and imaginary parts of F(x) with Rf(x) and If(x), 
as well as R z and I z for z, respectively, and expressing E{x, z) by these real and 
imaginary components, we obtain 

E{x, z) = {R 2 F + I 2 F ){R 2 Z + I 2 ) - 2I F I Z + 2R F R Z + 1 . (4) 

For compactness, the explicit dependence on x of Rf{x) and If(x) is removed here. 
Without losing generality, set d = 1 as an initial solution for convenience, so our 
task is to find all possible d' and z' to render E(x,z')/d' = E(x,z). To specialize 
our work, we consider the case when both gix) and fix) are non-relativistic BW 
functions^ 

9(x)- 



(x - M g ) + iT a ' 



/(*)" 



{x-Mf) + iT } 

where M and Y are the mass and width of a resonance, respectively. This BW-form 
amplitude function is chosen because it's broadly adopted in high energy physics. 
With the above forms of g{x) and /(x), the real and imaginary components of Fix) 
are 

n _ r f \r g r f + (M g -x)(M f -x)] 

T g [T} + (M f -xr] 

and 

j _ l'/T/(.H, r. I ,:.W ; ,-; 
T a [T) + (M f -xY] 

After some algebra, we get an interesting relation 

R 2 f + I 2 f = clRf + bI F + c, (5) 

with 



r r r 

1 a 1 a 1 a 



(6) 



With Eq. ([5]), E(x,z) is recast as 

Similar expression can be obtained for E(x, z'). Notice that Rp and Ip are functions 
in variable space (x space), while R z and I z are functions in parameter space (z — d 
space), if we want E(x, z')/d' = E(x, z) for any x, then the corresponding functions 
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wo in parameter space (the coefficients of the functions in variable space) should be 
141 equal. This requirement immediately yields 

aR 2 z , + al 2 , + 2R Z > = d'(aR 2 z + aP z +2R Z ) , 
bR 2 z , + bl 2 - 21 z> = d'(bR 2 z + bl 2 - 21 z ) , (8) 
cR 2 z , + cl 2 + 1 = d'{cR 2 z +cl 2 + l). 

143 In the light of the set of equations, it turns out that d 1 must satisfy a second order 

144 equation and there are two roots of it. One is the trivial solution with d' = 1 and 

145 z' — z correspondingly, and the other one is 

a 2 + b 2 + 4c 



d' = 



(a-2R z c) 2 + (b + 2I z c) 2 



M-^W«+^H (9) 



147 2.2. Some special solutions 

us In this section we consider some other forms for f(x) and g(x). The first form is 

149 from the KLOE experiment on e + e~ — > lott with both oj — > ir + ir~ir° and lo — > 777°. 

150 The cross section of e + e~ — > ljtt in the vicinity of the (j> resonance, as a function 

151 of the center-of-mass energy, -y/x, is parameterized as 

2 



<j{\/x) = o nr {\[x) 



(10) 



153 in Ref. [TUl where <r nr (y/x) = <jq + <r'{^fx — M4) is the bare cross section for the 

154 non-resonant process, parameterized as a linear function of yfx; M<p, 1^, and = 

155 M 2 — x — iM^Ttf, are the mass, the width, and the inverse propagator of the <j> 

156 meson, respectively. Here z is a complex number which depicts the interference 

157 effect. Comparing with definition of E(x,z), f(x) and g{x) have the forms 

i5s g[x) = -1 J{x) - 



Ml-x-iM^Y^ ■ 

The simple algebra yields R 2 F + Ip = —Ip, which in turn gives 

R% + I 2 + 21 z , = d'(R 2 + I 2 + 21 z ) , 

R»> = d'R z , (11) 
1 = d' . 

With the last equality d' = 1, the relation R z / = d'R z implies R z > = R z . then the 
first equation provides the other non-trivial solution 

z =R Z - i(I z + 2) . 

They are just the results acquired in Ref. IS by another method. 
As the second example, we consider the form 

<?(*) = -, /(*) = — X — -r-r, (12) 

X TO — X + ITO.1 
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W7 which is usually used to extr act the resonant information of oj in fitting the data 
lea of e + e~ — ?> 7r + 7r~ cross sectioiffH Here m and T indicate the mass and total decay 

169 width of the resonance. Accordingly, we obtain R 2 F + I F = —Hp + (If(( = — m/T), 

170 which in turn gives 

R\, + P z , - 2R Z , = d'(R 2 z + P z - 2R Z ) , 
C(R% + I 2 ,) - 21 z , = d%{Rl + I 2 ) - 2I Z ] , (13) 
1 = d' . 

172 After some algebra, we get the other non-trivial solution 

, 2 + (c 2 - ir - 2q, 2gi~R z )-(t 2 -i)i z 
z = iTe + iTc 2 ■ ( } 

174 We consider a more general case, when f(x) and g(x) are any non-trivial 

175 functions, but their ratio must ensure that the real or imaginary component of 

176 F(x) is constant^. In this case, there exist two solutions. Specially speaking, when 

177 F(x) — k + ih{x), with h(x) being a non-trivial function and k is a non-zero real 

178 constant. Here 

E(x, z) = h 2 (x)(R 2 z + I 2 ) - 2h(x)I z + k{R 2 z + I 2 ) + 2kR z + 1 . (15) 

From E(x, z')/d' = E(x, z) and we require all coefficients of each order of h(x) being 
equated respectively, i.e. 

R 2 Z ,+I 2 =d'(R 2 z +I 2 ) , 
-2I Z , = d'(-2I z ) , 
n 2 (R 2 z , +I 2 ) + 2kR z , + 1 = d'[n(R 2 z + I 2 ) + 2kR z + 1] . (16) 

Besides the trivial solution d! = 1 and z' = z, there exists the other solution to 
above set of equations, 

d> = 



179 
180 



184 

185 



4k 2 (R 2 z + I 2 ) +4kR z + 1 ' 
z' = -d'[2n{R 2 z + I 2 ) + R z ] + i{I z d!) . (17) 

When F(x) = h(x) + in, via similar derivation, the other non-trivial solution is 
obtained as 

i 

d! 



A K 2 {R 2 + I 2 )-AkI z + 1 ' 
z = R z d! + id'[2n{R 2 z + I 2 ) - I z ] . (18) 

There are also other cases where there are two solutions, such as when Rp is a 
linear function of If, or vice versa, we will not discuss them in details here. 

a This can be realized, for example, if f(x) = p f (x)e lB f ^ and g(x) = p g (x)e lS s ( x ) , where 
pf^ g (x) and 9f g (x) are any non-trivial functions. As long as there exist relations pf(x) = 
p g (x) ■ ^/h 2 (x) + k 2 and 9f(x) = g (x) + tan -1 (!i(x)/k) + 2irn (n: any integer), it always has 
F{x) = k + ih(x). 
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is9 2.3. Constraint on the amplitude functions 

wo Despite of the examples shown in the previous section, it is clear that the double- 

191 solution issue is not universal for any forms of the functions. Actually, it is easy 

192 to find some forms of f(x) and g(x), in which no multiple-solutions can be found, 

193 g(x) — x and f(x) = x 3 + ix 2 is such an example. 

194 This is why, although still far from the final answer, we want to discuss what kind 

195 of constraints can be applied to the functions if double-solutions exists, i.e. what 
we kind of amplitude functions, f(x) and g(x) in the preceding section, will guarantee 
197 double solutions. When we return to the study of two BW amplitudes, we notice 
i9s that the relation in Eq. ([5J is crucial for obtaining the double-solutions, and this 

199 relation provides a constraint on F(x). It is easy to check that all the special forms 

200 with double-solutions found by us, obey this requirement. In short: there will be 

201 two solutions if the Argand diagram of F(x) is a circle. 

202 3. Check and Application 

203 As a cross check, let's consider an ad hoc example: the parameters of the two BW 

204 functions and one solution are set as 

M g = 3.0, T 3 =0.4, M f = 2.1, r / = 0.1, z = l-i. 

206 Using the aforementioned method, we can find another solution, which is exactly 

207 repeated by fitting with maximum likelihood method. The comparison of the results 
2oa is shown in Table [TJ 

Table 1. Comparison between exact solution and that obtained 
from fit process in the case of summing up two simple BW func- 
tions. For the fit, toy MC is used to generate a 10,000 events data 
sample. 



Item 


Input 


The other sol. 


Fit I 


Fit II 


d 


1 


0.529 






R z 


1 


0.647 


1.019 ±0.054 


0.644 ± 0.040 


h 


-1 


1.588 


-1.019 ±0.060 


1.601 ± 0.028 


M 3 


3.0 


3.0 


3.011 ±0.010 


3.011 ± 0.010 


r 9 


0,1 


0,1 


0.402 ±0.010 


0.402 ± 0.010 




2.1 


2.1 


2.101 ±0.003 


2.101 ± 0.003 




0.1 


0.1 


0.101 ±0.003 


0.101 ± 0.003 



209 This example indicates that in principle, the fit procedure can be used as a 

210 feasible approach to find the multiple solutions from the experimental data. 

2n It is obvious that, for the two BW amplitudes case, if one solution is obtained 

212 by fitting to the data, the other one can be readily and analytically obtained by 

213 applying Eq. ([9]). This definitely saves a lot of time and energy. However, due to the 

214 complexity of the expressions in practice, the solution has to be obtained from the 
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215 
216 
217 
218 
219 
220 
221 
222 
223 
224 
225 
226 

227 

228 
229 
230 
231 

232 

233 
234 
235 
236 
237 
238 
239 
240 
241 
242 
243 
244 

245 

246 
247 
248 
249 
250 
251 
252 



following numerical method. Firstly, we draw F(x) in the complex plane to check 
whether it is a circle. If the answer is yes, we need to determine the parameters a, 
6, and c in relation R 2 F (x) + Ip(x) = aRp(x) + blp{x) 4- c. This can be achieved by 
randomly selecting three points on the curve, i.e. set x in F(x) with three randomly 
chosen values, to obtain three linear equations with a, 6, and c as variables, whose 
values can be got by solving the set of equations. This process should be realized 
numerically when F(x) takes a very complicated form in practice. With a, &, and 
c obtained, we can derive the other solution with Eq. (J9j) as was shown before. We 
illustrate this method by the examples which are selected from the initial state radi- 
ation measurements at BaBar and Belle™, where the iy + tt ip(2S) and n + n J/ip 
invariant mass distributions are described by two coherent resonances. The cross 
sections are formulated as 



where BW± and BW2 represent the two resonances and is the relative phase 
between them. Then in our frame BWi, BW2, and e 1 ^ represents g(x), f(x), and 
z respectively, and F(x) = BW2/BW1. And the BW form of a single resonance in 
these two papers is 



where M is the mass of the resonance, T to t and T e + e - are the total width and 
partial width to e + e~, respectively, B(R — > /) is the branching fraction of R decays 
into final state /, and PS(s) is the three-body decay phase space factor. Noticed 
here that F(x) is very complicated. We draw it in the complex plane and find it is 
in a good circle shape, then we can obtain the parameters a, 6, and c from this plot. 
Using Eq. (jSJ) and the first solution as input we obtain the second solution as shown 
above; or reversely, using the second solution as input to obtain the first one. All the 
results from our method and comparisons with the experimental fits are shown in 
Table where only the statistical errors are quoted for experimental measurements 
and only the central values are used as input to get the other solution. From the 
Table it is clear that our results reproduce the results from the fit process very well, 
and we consider this as a justification of our method. 

4. Discussion 

As been found, when the measured distribution is described by \g{x) + zf(x)\ 2 /d 
and F{x) — f(x)/g(x) fulfills the relation of Eq. ([5]), i.e., F(x) is a circle in complex 
plane, there are and only are two non-trivial solutions. It has also been proved that 
if f(x) and g(x) are both simple BW functions, this relation is exactly satisfied 
and Eq. ([9]) can be utilized to derive the other solution analytically from the one 
obtained from the fit. For other transmogrified BW functions, which could be ex- 
tremely complicated, the relation Eq. ([5]) still hold for F(x) by numerical checking. 




October 14, 2011 0:22 WSPC/INSTRUCTION FILE draft 'multi'solution 



9 



Table 2. Comparison of the exact solutions with those obtained from the fit process for two real experimen- 
tal measurements. For experimental results, only statistical errors are quoted; for our method, solution I is 
obtained with solution II as input, and vice versa. 



Solutions 


I 






II 






Items 


BT e+e -{Rl) 


BF e+e -(R2) 




BT e+e -(Rl) 


BF e+e -(R2) 




fit results in Ref. [T] 
by our method 


5.0 ± 1.4 
4.7 


6.0 ± 1.2 
6.4 


12 ±29 
12 


12.4 ±2.4 
12.8 


20.6 ±2.3 
20.4 


-111 ± 7 
-111 


fit results in Ref. Q_] 
by our method 


11 1+ 1 ' 3 

ll.-l._j 2 
11.1 


9 9+0.7 
z - z -0.6 
2.1 


-12+23 
18 


12.3 ± 1.2 
12.3 


5.9 ± 1.6 
6.0 


'*-12 
-74 



253 So there will be double solutions for these forms too and with a, b, and c obtained 

254 numerically the other solution can be derived by using the same method. The ex- 

255 cellent consistency between our exact solutions and experimental fit results justifies 

256 this method. We should mention that when we use the experimental fit results as 

257 input, only the central values are considered to prove the method is valid. In princi- 

258 pie, one can obtain the errors of the parameters in the second solution given the full 

259 covariant matrix is available for the first solution (unfortunately, very often, exper- 

260 imcntal papers only give diagonal errors, and the correlations between the variables 

261 are not reported). The uncertainty of the numerical method is not discussed in case 

262 of complicated BW forms, as the main purpose of this paper is to examine whether 

263 the other solution exist or not, and how to find it. 

264 We also notice that for both solutions, the parameters of each resonance are same 

265 but the normalization factors. This implies that the couplings to decay channels are 

266 different for different solutions and some experimental reports may not be complete 

267 if only one solution was reported while there are two-fold ambiguities. So we suggest 

268 any experiment measurement with potential multiple-solution problem redo the 

269 analysis to find out the other solutions. Finally, we should point out that from 

270 Eq. ([4]) we may find more conditions where double solutions exist. For example, if 

271 the real or virtual component of F(x) is zero or the real component of F(x) is a 

272 linear function of the virtual one, there should be double solutions too. However, 

273 they are not normal in high energy physics so we do not discuss them in detail here. 

274 Furthermore, only the sum of two coherent amplitudes is considered in this paper, 

275 the generalization to more amplitudes is still in progress. 
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